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1 Introduction 

I will summarize the basic ideas of the Collins-Soper-Sterma 
(CSS) approach to TMD factorization [TJ[31[3J|3J[3 , and the 
updated version in Ref. [6j Chapts. (10,13,14)], for formu¬ 
lating transverse momentum dependent factorization. In 
this context, “transverse momentum dependent” (TMD'' 
refers to QCD treatments of inclusive observables wi 
at least one perturbatively hard scale Q, and a separa 
transverse momentum that can vary from 0 to order ( 

It also refers to related objects like TMD parton distrib 
tion functions (PDFs) and TMD fragmentation functio: 
(FFs). Studies of TMD objects have been driven by a c 
verse set of motivations that include testing perturbati' 
QCD, probing hadronic structure, and providing calcul 
tions for general particle and nuclear physics experimeni 

A trend in TMD physics is that initial intuition fr 
quently needs to be revised as quantum field theoretic 
details come into focus. I will build on this observation 
motivate greater general interest in TMD physics as < 
arena for unpacking foundational QCD concepts. 

In its complete form, a TMD factorization theore 
should apply to a variety of processes and allow for cor 
parisons between them. But for a concrete discussion, 
begin by considering Drell-Yan (DY) scattering. The pr 
cess is shown in cartoon form in Fig. [1] In the center 
mass frame, a hadron enters from the left with large “pluo 
momentum and another enters from the right with large 
“minus” momentum. An antiquark from one hadron an¬ 
nihilates with a quark from the other, and the resulting 
virtual photon splits into a lepton-antilepton pair 

with total four-momentum q. The relevant observable is 

( 1 ) 

d^gdC ’ ^ ’ 

where dl7 is the phase space of the l'^l~ pair. Dependence 
on the total transverse momentum qx of the pair 




(b) 

Fig. 1. A cartoon depiction of Drell-Yan scattering with an 
antiproton entering from the left and a proton entering from 
the right. The explosion is the hard part. In (a), the gluon is 
taken to be part of the proton wavefunction, making this a 
Type I picture in the language of Ref. [7]. In (b) the gluon is 
associated with hard perturbative QCD radiation, making this 
a type II picture. (Or course, there are many more gluons than 
just the one shown explicitly.) 
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is of special interest at small values and when possible 
correlations to spin dependence are included. Descriptions 
of Eq. dl]) naturally generalize to the production of other 
colorless bosons Z,Higgs). 

Other processes where TMD factorization applies in¬ 
clude semi-inclusive deep inelastic scattering (SIDIS) and 
l'^l~ annihilation into back-to-back jets or hadrons. In 
each of these, there are two large forward and backward 
directions analogous to the large positive and negative ra¬ 
pidities of the proton and antiproton in Fig. [TJ and each 
has a hard scale Q and transverse momentum that be¬ 
comes sensitive to intrinsic transverse momentum when 
small. 

The question of how to describe TMD cross sections 
like Eq. dD) in QCD has traditionally been approached 
from two rather opposite conceptual starting points, which 
Feynman, Field and Fox [7] (FFF) identified early on, call¬ 
ing them “Type I” and “Type II” descriptions. They dif¬ 
ferentiated between the two pictures as follows: 

Type I: A description in terms of colliding bound states 
is taken very literally, with the total transverse mo¬ 
mentum of the pair assumed to originate entirely 
from the nonperturbative intrinsic motion of partons, 
such as that associated with an incoming “wavefunc- 
tion.” (See Fig. [TJa).) 

Type II; The transverse momentum of the pair is 
understood to originate from the radiation of gluons 
before and after the collision. In a type II picture, it is 
assumed that all or most of such radiation is describ- 
able using small as methods. (See Fig. [TJb).) 

Note the similarity between Fig.[T]of this paper and Fig. 6 
of FFF. In reference to their Fig. 6, FFF write 

“There has been much speculation about how much 
of the dimuon kr spectra shown in Fig. 7 [DY data 
from Ref. [sjiniiiniin]] is due to the wave function 
(Type I) and how much is explained by QCD per¬ 
turbation calculations (Type II).” 

To some extent, this quote remains true today, though the 
situation is in some ways even more interesting after the 
advances in QCD theory of the intervening decades. 

Although it is a useful starting point for general dis¬ 
cussions of TMD physics, the type I / type II dichotomy 
is rather artificial, and a sharp distinction becomes elu¬ 
sive when one tries to formalize it. Figure 6 of FFF al¬ 
ready exhibits some of this difficulty. Compare, for ex¬ 
ample, the relatively large intrinsic transverse momentum 
of (fcx) 848 MeV found by FFF with other expec¬ 
tations suggested around the same period, such as the 
(fcx) ~ 300 MeV proposed in Ref. [H] on the basis of a 
parton model. Also, given what is now understood about 
nonperturbative evolution, a significant fraction of the 
transverse momentum width is likely actually due to non¬ 
perturbative radiation that does not fit into either a Type 
I or Type II category in an obvious way. 

In type I oriented approaches of the past, one typically 
finds discussions of TMD parton models, TMD PDFs, and 


effects from nonperturbative wavefunctions. It is an ap¬ 
proach that is used to address problems in hadron struc¬ 
ture, such as the orbital angular momentum composition 
of hadrons. See, for example. Ref. m from this collection 
for a review. 

By contrast, in type II oriented approaches one typ¬ 
ically finds discussions of fixed high order calculations 
and/or gx-resummation, with nonperturbative transverse 
momenta only entering in the form of small corrections. 
Applications are to be to high energy physics and very 
large hard scales, where nonperturbative effects tend to 
get washed out. 

One important development of roughly the last decade 
is a trend toward a convergence of Type I and type II 
oriented approaches into a single TMD formalism. This 
will be an organizing theme for this review. 

In Sect. [2] I will expand on the motivations for TMD 
physics that came mainly from the hadron structure per¬ 
spective, and which has been traditionally seen as a more 
type-Toriented perspective, while in Sect.|3]I briefly men¬ 
tion the type-II-oriented perspective. In Sect.|4]I will give 
an overview of the development of full QCD approaches 
to TMD factorization. In Sect.O I will summarize the ba¬ 
sic formulas of TMD factorization as they now stand. In 
Sect. [HI I will discuss solutions to the evolution equations, 
and in Sect. [7] I will end with concluding remarks. 


2 Type I Approaches: TMD functions in 
hadronic structure and nonperturbative 
physics 


For describing Type I physics, the parton model is of¬ 
ten generalized to include an intrinsic transverse momen¬ 
tum for parton distribution functions and fragmentation 
functions - see, for example. Ref. [HI Eq. (9.13)] and the 
surrounding discussion. In the DY case, for example, one 
writes 


dcr 


33 ' 


X 


d^fcx Fj/AixA, fcx, SA)Fji/B{xB,qT — fcx, Sb) 


+ p.s.c.. 


( 2 ) 


The basic structure is analogous to collinear factorization, 
but the usual collinear parton distribution functions, with 
their dependence on longitudinal momentum fractions xa 
and XB^ are replaced by TMD PDFs with additional de¬ 
pendence on the intrinsic transverse momenta (kx.^i = fcx 
and kx.B = qr — k^). Ff /h(xh, k--£, Sh) labels a probabil¬ 
ity density for finding a parton of flavor / inside a hadron 
of species F[. The overall hard part is T-Ljj'■ It is usually 
set equal to the zeroth order partonic vertex in a type I 
approach. Possible spin dependence is indicated by Sh- 
The “p.s.c.” means “power-suppressed corrections.” For 
processes like SIDIS and e+e" annihilation into back-to- 
back hadrons, formulas analogous to Eq. @ are needed, 
but with TMD fragmentation functions. 
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From here forward, I will refer to TMD parton distri¬ 
bution functions (TMD PDFs) and TMD fragmentation 
functions (TMD FFs) generically as “TMDs.” 

The sensitivity to an intrinsic transverse direction makes 
TMDs natural objects of interest for spin physics because 
new nonperturbative correlations become possible relative 
to the collinear leading twist case. As early as 1978, Cahn 
used TMD PDFs to describe azimuthal asymmetries in 
SIDIS |15j . Intrinsic transverse momentum can become 
correlated with various components of spin and induce 
asymmetries in the cross section. In 1990, Sivers proposed 
a now famous TMD mechanism m for explaining the 
large transverse single spin asymmetries in experiments 
like [T71[T5l[roi[^[^[^[^ 0 In the polarized TMD PDF 
called the Sivers function, the transverse momentum of an 
unpolarized quark becomes correlated with the transverse 
spin of its parent hadron. 

If one takes a basic number density operator very lit¬ 
erally when defining the TMD PDFs, then it appears that 
time-reversal and parity invariance (TP) in QCD requires 
the Sivers function to vanish. This was pointed out by 
Collins in Ref. m, where he also proposed an alterna¬ 
tive TMD mechanism for generating asymmetries based 
on fragmentation in the final state. In the “Collins effect,” 
the transverse spins of the quarks involved in the hard col¬ 
lision become correlated and remain entangled over long 
time and distance scales. This results in an azimuthal 
asymmetry in the distribution of unpolarized hadrons. 
The TMD called the Collins fragmentation function de¬ 
scribes the likelihood that a transversely polarized quark 
fragments into a particular unpolarized hadron with some 
small transverse momentum relative to the parent quark. 

Over the next decade, the role of TMD functions in 
describing spin effects grew more prominent in the litera¬ 
ture. In 1995/1996, Mulders and Tangerman classified the 
leading power azimuthal and spin dependencies allowed 
in TMD functions by parity and rotation invariance m 
HH]. The “Boer-Mulders” TMD PDF [53] is one particu¬ 
lar TMD function that illustrates the interesting fact that 
transverse spin and momentum correlations can be impor¬ 
tant at leading power even in unpolarized scattering. 

If the hard scales under consideration cover only a 
small range, then a reasonable strategy for phenomenol¬ 
ogy is to maintain a purely parton model description, and 
for a long time this was the most common approach in 
most applications of TMD functions to spin physics and 
hadron structure. (Reference [30] is an example of this 
type of application to phenomenology.) Usually, the TMD 
is written as a collinear function with a modulating factor 
for TMD dependence: 

Fg/pix, kp) = fg/p{x)e{kT) , (3) 

and then requiring 

J cPkr Fg/p{x, kr) = fq/pix) . (4) 

^ See Ref. |24II25| for some experimental overviews of TMD 
physics. 


The modulating factor 0(fcr) is usually taken to be Gaus¬ 
sian and x and z independent. See, for example. Ref. [141 
Eq. (9.14)]. 

While the role of nonperturbative transverse momen¬ 
tum was becoming less of a focus in many of the appli¬ 
cations of gx-resummation to high energy physics, efforts 
like those summarized in the last few paragraphs, partic¬ 
ularly when applied to spin physics, focused on intrinsic 
transverse momentum as a way to study non-trivial as¬ 
pects of fundamental QCD. 

However, the work discussed in this section so far was 
mostly done in the context of a TMD parton model or 
parton-model-like description. The situation becomes very 
interesting when going beyond a TMD parton model pic¬ 
ture. In incorporating perturbative QCD, one expects the 
TMD functions to acquire scale dependence through renor¬ 
malization group (RG) dependence, analogously to collinear 
PDFs. Perturbative QCD predicts a large transverse mo¬ 
mentum dependence that is power-like rather than Gaus¬ 
sian. In 1991, Chay, Ellis and Stirling used TMD PDFs to 
describe azimuthal asymmetries in SIDIS, with a match¬ 
ing to perturbative behavior at large transverse momen¬ 
tum m- A more recent discussion of the matching of 
large and small transverse momentum regions in TMD 
functions is in Ref. [32] . 

Much of the work of the full QCD approach began very 
early, but was not commonly incorporated into type-I non¬ 
perturbative physics studies like those discussed above un¬ 
til relatively recently. This will be discussed in more detail 
in Sect. 0] 


3 Type II physics and collinear factorization 

In traditions that approach observables like Eq. o from 
a more Type-ITlike perspective, one typically starts from 
calculations of large transverse momentum in perturbative 
QCD, and attempts to extend the description to smaller 
qt- One works entirely in collinear factorization so that the 
only nonperturbative objects that appear are the collinear 
PDFs and FFs. One example is transverse momentum re¬ 
summation ElIM], which incorporates large logarithms of 
Qt/Q to all orders in Og. These approaches typically as¬ 
sume Aqcd <C gx ^ Q and ignore intrinsic nonperturba¬ 
tive transverse momentum effects, or at least refrain from 
accounting for them with detailed QCD considerations. In 
certain practical circumstances, such as at very high en¬ 
ergies and large hard scales, these approaches may be suf¬ 
ficient, because sensitivity to nonperturbative transverse 
momentum become suppressed in the limit of infinite Q, 
even down to small gx ESj. The advantage is that calcula¬ 
tions can be done without needing nonperturbative input 
that is often unknown or poorly constrained. However, 
working from a purely type H approach means that one 
cannot directly connect results to lower Q measurements 
where nonperturbatuve transverse momentum definitely 
becomes important, and one also abandons the study of 
nonperturbative transverse structure itself via the extrac¬ 
tion of TMD functions. Of course, in a full QCD treat- 
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ment resummation-like results should emerge naturally in 
the large Q limit. 

For more discussion of (/T-resummation, see for exam¬ 
ple Ref. [141 Chapt. 9.0] and Ref. [SHI Chapt. 6] and refer¬ 
ences therein. See also the cautionary remarks regarding 
(jT-resummation techniques in Ref. [SI Sec. 13.13.5]. 


4TMDs and QCD 

Notions of intrinsic transverse momentum and TMD func¬ 
tions appeared early on in considerations of full QCD [371 
[33] ■ It is clear from the FFF discussions that a com¬ 
plete QCD formalism, like a TMD factorization formalism, 
would involve combination of type I and type II physics. 
In fact, the CSS formalism is a TMD factorization, though 
the way it was originally presented and subsequently ap¬ 
plied possibly discouraged its rapid adoption in areas like 
spin physics. An early exception to the tendency to ne¬ 
glect TMD evolution in hadron structure phenomenology 
are the papers of Boer [331I3D] . These are the first cases 
I know of where CSS-style evolution is applied directly 
to the phenomenology of single-spin asymmetries and az¬ 
imuthal asymmetries directly identified with TMDs in the 
type-I sense of Sect. [2] See also Refs. [4T1I42] . CSS style 
treatments similar to Refs. [33113311351133] for unpolarized 
SIDIS were extended to the polarized case in Refs. m- 

An extra complication with TMD parton model ap¬ 
proaches is that definitions that use the naive number 
density operator contain extra “light-cone” divergences. 
The light-cone divergences remain if light-like Wilson lines 
are used to enforce gauge invariance in TMD definitions, 
even if infrared and ultraviolet divergences are regulated. 
Unlike the normal infrared divergences, which signal the 
onset of genuine nonperturbative physical phenomena in 
the region of soft physics, the light-cone divergences are 
artifacts of the approximations that separate the cross sec¬ 
tion into different factors for widely separated regions of 
rapidity. (They are partly artifacts of ignoring the role of 
soft gluons.) Light-cone divergences describe gluons with 
infinite rapidity in the direction opposite that of the par¬ 
ent hadron. For the approximations to be consistent with 
factorization, therefore, the light-cone divergences need to 
be regulated and dealt with in some way. 

In 1981, Collins and Soper (CS) introduced operator 
definitions for TMD PDFs and TMD FFs [T] Eqs. (2.1) 
and (4.9)], and these definitions remain adequate for many 
purposesjj The TMD FFs from Ref. [1] were used in the 
derivation of the CS equation in Ref. [^ - see [lEq. (3.8)]. 
Light-cone divergences were handled by defining TMDs in 
a non-light-like axial gauge. Early discussions of light-cone 
divergences and the axial-gauge-method of dealing with 
them are discussed in the pioneering work of Refs. [371133] . 
(These papers also contain useful references for many of 
the now standard techniques, such as the use of non-light- 
like axial gauges.) In particular. Ref. [371133] showed how 
TMD parton correlation functions acquire dependence on 

^ The term “decay function” was used in Refs. m rather 
than “fragmentation function.” 


an auxiliary scale C, = (2P-n)^/(—n^), where P is the pro¬ 
ton four-momentum and n is a non-light-like gauge fixing 
vector 0. The dependence on ( gives rise to loga¬ 

rithmic scaling violations in a complete cross section. For 
the Sudakov form factor, a derivation of the correspond¬ 
ing evolution in perturbative QCD was given in Ref. |49) . 
and for the CS TMD functions in Refs. [Il[2]. An analysis 
of TMD PDFs in structure functions is given in Ref. |50] . 
The ( scale, with its connection to the gauge fixing vector 
n, can be thought of as a cutoff on light-cone divergences. 
The CS equation gives the evolution with respect to the 
direction of the gauge fixing vector n and restores predic¬ 
tive power which would otherwise be lost by having an 
extra parameter 0 

The TMDs, defined in a non-light-like axial gauge as in 
Refs. [Hd], use the same auxiliary scales ( as in Ref. [33] . 
Since they make explicit use of the gauge fixing vector in 
treating light-cone divergences, the early definitions of the 
TMDs in Refs. [T|[3] were not gauge invariant. Treatments 
in Ref. [31|3] did propose gauge invariant definitions for 
the TMD PDFs, using non-light like Wilson lines to reg¬ 
ulate light-cone divergences, and similar procedures have 
come to be preferred. However, the CSS formalism for 
hadron-hadron scattering, as it was presented in Ref. [5], 
was based on the earlier derivation of TMD factoriza¬ 
tion for e“'"e“ annihilation into back-to-back jets with the 
non-light-like axial gauge definitions of Ref. [T] . The orig¬ 
inal definitions of the TMD functions are also modihed 
in Ref. [5] relative to those of Refs. m such that the 
overall hard factor is unity and there is no explicit U{b) in 
the factorization formulaQ (See the footnote at Eq. (3.3) 
of Ref. [5] and the discussion that begins with that equa¬ 
tion.) 

In most derivations of factorization, especially when 
initial state hadrons are involved, determining a method 
for dealing with gluons in the “Glauber” region is a major 
step toward the ultimate factorization, and it is the source 
of many of the subtleties that affect the TMD definitions. 
The Glauber region describes gluons whose longitudinal 
momentum components vanish while the the transverse 
components remain small (say ^ Aqcd) but fixed. The 
approximations that would normally allow one to apply 
Ward identities and eikonalize soft and collinear gluons 
fail in the Glauber region. So Glauber gluons threaten 
to spoil factorization [52] @ Therefore, any derivation of 
factorization (either collinear or TMD) must show that 
Glauber region contributions either cancel in a sum over 
all graphs, or are avoided by contour deformations in the 
integrals over gluon momenta. Observables that involve 
collisions between two hadrons are especially challenging 
because gluon exchanges between parton spectators are 
“pinched” in the Glauber region, blocking straightforward 

® Early work such as Refs [MIEE did not use the termi¬ 
nology “TMD,” which became common only later. 

U{h) is related to the well-known soft factor, often called 
S{b). 

® An analogy can be made between Glauber gluon exchanges 
at the partonic level and multiple nucleon interactions in a 
Glauber model of nucleus-nucleus scattering. 
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contour deformations. For inclusive DY, the Glauber re¬ 
gion contributions can be shown to cancel for spectator- 
spectator interactions [iiiiiiisiisg. See also [SS] and 0 
Chapts.14.3-14.5] for more recent pedagogical overviews^ 

In the TMD case, many of the steps for showing a can¬ 
celation of Glauber region effects, especially in spectator- 
spectator interactions, carry over from the inclusive collinear 
case. GSS used this in Ref. [5] to carry TMD factorization 
results originally obtained for l'^l~ annihilation over to 
the DY case. 

The details of how Glauber regions are avoided by con¬ 
tour deformations are closely connected to establishing 
good TMD definitions [35]. The corresponding subtleties 
are associated with many of the nonintuitive aspects of 
TMD factorization (see the discussion of the Sivers effect 
below). 

The original GS definitions in Refs. m, and subse¬ 
quent definitions in Ref. [5] that are based on them, are 
sufficient for capturing much of the physics needed to set 
up basic factorization theorems. The GSS presentation 
is the starting point for successful applications to phe¬ 
nomenology, especially in unpolarized Drell-Yan-type pro¬ 
cesses and e“''e“ annihilation into back-to-back hadrons 
and jets. It was extended to the SIDIS case in Refs. |43l 
I44| . The CSS formalism in this or roughly similar forms 
has been widely applied. See, for example. Refs. iMiisniisii 
l45U46U621l63ll64l[65] . It is often the most convenient way of 
formulating cross section calculations for some practical 
calculations, especially in contexts where there is com¬ 
paratively little sensitivity to, and/or interest in, the pre¬ 
cise details of nonperturbative transverse momentum. It 
also clearly displays the underlying simplicity of solutions 
to the TMD evolution equations, and it makes very ex¬ 
plicit the matching to collinear correlation functions and 
collinear factorization in the limits of large gx and Q. 

However, these early definitions had shortcomings that 
made them non-ideal for confronting some of the issues 
discussed in Sect. |2| Some of the problems are mainly or¬ 
ganizational. One possibly confusing aspect of the presen¬ 
tation in Ref. is that the central result - that the for¬ 
malism is, first and foremost, a TMD factorization formal¬ 
ism - appeared only later in the paper, in Eqs. (5.2,5.8). 
The first equation of the paper, which might appear to a 
reader to be the main result, contains no explicit intrinsic 
nonperturbative transverse coordinate (6t) dependence. 
Perhaps as a result, the GSS formalism is now frequently 
referred to as a resummation. However, it was intended 
to be more powerful than resummation methods; the GSS 
formalism was meant to be a true TMD factorization for¬ 
malism, with a valid pQCD perturbation expansion of the 
hard part and renormalization group equations for all 6 t, 
even in the limit oi bx ^ oo where fex-dependence is non¬ 
perturbative (Collins, private communication). 

If the hard scale is extremely large, the effects of per¬ 
turbative radiation becomes so dominant that all non¬ 
perturbative transverse momentum effects are washed out 

® Reference [51 pg. 12] notes an interesting similarity between 
Glauber cancelations and the AGK unitarity cancelations m 
in Regge theory. 


even for gx ^ 0 [35]. The process gradually becomes en¬ 
tirely a type H process. In this sense, questions of whether 
interesting nonperturbative TMD phenomena like the Sivers 
effect are washed out at larger Q are correlated to the 
question about whether Q is large enough that purely 
perturbative calculations of transverse momentum depen¬ 
dence are reliable to within a desired accuracy. In Ref. [5] , 
an estimate was given for the scale where the DY cross 
section becomes completely insensitive to nonperturba¬ 
tive transverse momentum dependence, and a value of 
Q « 10 PeV was found. Given the relative insensitivity to 
nonperturbative transverse momentum observed at facili¬ 
ties like the Tevatron [66ll67] , it may be that this estimate 
needs to be revised downward. However, even at scales as 
large as weak boson masses, the nonperturbative contri¬ 
bution seems to be important for currently desired levels 
of accuracy and precision [SilMlES]. 

Even when nonperturbative 6x-dependence is included, 
the factorization formulas of Ref. [5] do not immediately 
resemble TMD parton model formulas like Eq. ([2]). The 
connection between TMD functions (including the many 
subtleties involved in defining those functions) and the 
evolved factors used in actual cross sections becomes some¬ 
what indirect. An example of how this can lead to practical 
consequences can be seen in discussions of the Sivers func¬ 
tion. In 2002, Brodsky, Hwang, and Schmidt (BHS) [7T] 
used an explicit model calculation to demonstrate that 
final state interactions in SIDIS can give a transverse sin¬ 
gle spin asymmetry at leading power in Q. This was pre¬ 
sented as a conflict with factorization itself; they argued 
that the effect cannot be associated with parton densi¬ 
ties or fragmentation functions. But Collins showed [73] 
that in fact the result is consistent with TMD factoriza¬ 
tion. Instead the BHS calculation effectively demonstrated 
that a Sivers-like effect is non-vanishing. While the TP- 
invariance argument m mentioned earlier appeared to 
show that the Sivers function vanished, there is a loop¬ 
hole arising from the Wilson lines in a gauge-invariant 
definition of TMD functions. When applied to those defi¬ 
nitions, TP invariance shows that the Sivers function used 
in the Drell-Yan process has the opposite sign to the one 
in SIDIS, not that it vanishes. Thus, what might at first 
seem like a contradiction between factorization and the 
direct calculations of Ref. m is largely due to an unclear 
connection between TMD definitions, as they had been 
presented up to that time, and their origins in a factoriza¬ 
tion derivation. The TMD factorization derivation relies 
on contour deformations that avoid the Glauber region, 
and the way this procedure gets modified to factorize dif¬ 
ferent processes introduces a process dependent sign and 
the associated Wilson line direction. 

Another issue with the factorization as organized in 
Ref. [5] is that perturbatively calculable process depen¬ 
dence was moved out of the hard part and into the factors 
that most resemble the evolved TMD functions; the hard 
part is simply 1. This gives a simple arrangement in some 
sense. But in a truly factorized TMD formulation, one ex¬ 
pects all fixed order perturbative process dependence to 
be explicitly factorized. This was especially clearly pointed 
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out by de Florian and Grazzini in Refs. Catani et 

al. subsequently developed an approach to organizing non- 
universal contributions into a separate hard factor [Tam 
IZZUZSIIZH]- But with well-defined TMD definitions, includ¬ 
ing a specification of a renormalization scheme, the hard 
part is uniquely determined automatically from Eq. 

I have listed several issues that highlight the impor¬ 
tance of considering TMD definitions in detail and ex¬ 
amining their origins in a factorization theorem. The dis¬ 
cussions of the last few paragraphs also motivate natural 
conditions for optimally defining TMD PDFs and FFs: 
a.) They should have definite operator definitions that 
are separately gauge invariant and account for any in¬ 
stances of nonperturbative process dependent signs, b.) 
Apart from the process dependent signs, they should be 
universal and provide a prescription for calculating hard 
parts, c.) They should combine perturbative and nonper¬ 
turbative information in a way that allows one to simulta¬ 
neously extract maximum advantage from the universality 
of nonperturbative parts and from the perturbative calcu- 
lability of small coupling parts. 

In fact, the details of how to precisely define the TMDs 
becomes a guiding question for setting up the newer TMD 
definitions in next section. The universality of the TMD 
functions needs to be modified from the parton model in 
at least two respects: 1.) there is process dependence in the 
sense of dependence on a hard scale Q via evolution (just 
as in collinear factorization) and 2.) there is dependence 
on the direction of the Wilson line. The second of these 
is a novel modification of the more familiar universality 
concept encountered when dealing with collinear PDFs. 

The early 2000s saw growing attention paid to the issue 
of precisely defining TMD functions, largely due to grad¬ 
ually increasing recognition of the relevance of predictions 
like the changing Sivers function sign and the inadequacy 
of light-cone Wilson lines for definitions. See, especially. 


Refs. . Refer¬ 

ence [82] contains a useful summary of the issues as they 
stood approximately a decade ago. These considerations 
led to the formulation that will be discussed in the next 
section. 


5 Basic statement of TMD factorization 

The organization of TMD factorization theorems and their 
TMD definitions, as it will be presented in the remaining 
sections, is based on Ref. [B]. This formulation is intended 
to satisfy the criteria listed at the end of the last section. 

5.1 Factorization formulas 

Expressions with TMDs are most easily expressed in trans¬ 
verse coordinate space: 

Fj/p{x,kT,SA]fJ-XPDF) = 

J Ej/p(a;, 6T,5'A;Ai, Cpdf) , (5) 

DH/j{z,zkT, Cff) = 

J Cff) • (6) 

Note the convention to write the left side of Eq. m as 
a function of zk^ rather than kp. The nonperturbative 
behavior associated with small qt is associated with large 
6 t behavior in the coordinate space functions. 

TMD factorization theorems are best established the¬ 
oretically for the classic electromagnetic processes of DY, 
SIDIS and the annihilation of e“'"e“ pairs into a back-to- 
back hadron pairs. The basic statement of TMD factor¬ 
ization for these three processes is [Bj: 


(/i/Q; asiX) J d^br bx, C, X bx, Sb; QVC, X + Ydy , (7) 

c^siX) J d'bx bx, 5^; C, X Db/j{za, bx, Sb; QVC, X + Ysims , (8) 

J d^bT Da/XzaXtXa-X,^ DB/j{zB,bT,SB;QV(:,X +>;+e- • (9) 


The left side is a cross section differential in at least 
transverse momentum qx, defined in an appropriate refer¬ 
ence frame, and the “• • • ” represents possible dependence 
on other kinematic variables like rapidities. The first term 
in each equation has a structure like that of a TMD parton 


model. For example, use Eq. m to write 

J <PbT Fljl{xA,bT,SA;C,X^ 

^ ^Jb{xbXtXb; Q^K,h) 

= J (PViTA J d^kT,2 F^jl{xA,kTA,SA;C,Xx 

X Fj^I{xb, kT^2, Sb; — kx,! — kx,2), 

( 10 ) 
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and compare with Eq. (12). In each of Eqs. (O-dll), there 
is a convolution of two TMD functions, each being ei¬ 
ther a TMD PDE (labeled by E) or a TMD FF (la¬ 
beled by D). Capital letters are used here to distinguish 
TMDs from collinear PDFs and FFs. The first term is 
called the “bP-term.” The bP-terms differ from a parton 
model picture by the presence of an explicit hard fac¬ 
tor and by the appearance of evolution 

scales C and /r. The scales are exactly arbitrary, but in 
applications they should be set to values of order and 
Q respectively to enable well-behaved perturbative calcu¬ 
lations. The “[±]” superscripts represent process depen¬ 
dent Wilson line directions, following the notation of [83]. 
The [-b] means future-pointing and the [—] means past¬ 
pointing. The Sa and Sb denote possible polarization de¬ 
pendence. From here forward, the power suppressed cor¬ 
rection term will be assumed implicit. 

Transverse momentum dependence can be factored into 
separate TMD functions only when <C Q. For very 
large qt, a correction is needed, though it is calculable in 
pure collinear factorization. The correction is indicated by 
the last term in each equation, called the “F-term.” 

It is probably best to apply the term “TMD factor¬ 
ization theorem” to the complete collection of formulas in 
Eqs. da-dll), including the set of universality properties of 
large distance parts and the F-terms, rather than to any 
one equation alone. 


5.2 Definitions 


Each TMD function is defined in terms of quark and gluon 
field operators. I will use SIDIS as a reference process for 
setting up the definitions, with the directions of the incom¬ 
ing proton and outgoing hadron defining the large “-I-” and 
” directions respectively. Define space-like directions by 
the vectors 


«A(yA) = (I,-e nB( 2 /s) = (-e^“,l,Ot). 

( 11 ) 

These approach light-like plus (minus) vectors when yAivs) 
approach -|-(—)oo. 

To build up TMD definitions, one first needs to define 
a soft factor. Define the Wilson line from a; to oo along 
n in terms of a bare coupling and the bare gluon field 
operator: 


bF(c 


i) = P exp 


-^9o 


ds n ■ Aq{x + sn)t° 


( 12 ) 

This Wilson line is in the color triplet representation, with 
ta being the SU(3) generators in the fundamental rep¬ 
resentation. P is a path-ordering operator. Following la 
Eq. (13.39)], the soft factor is the vacuum expectation 
value of a Wilson loop: 


J 


S(o){bT, Ua, Ub) 

= ^(0|bF(5T/2, oo; nB{yb))\a ^ (^t/2, oo; nA(ya))asW(-bT/2, oo; nB(yb))j3j'fV (-bx/S, oo; nA( 2 /a))] 5 ^| 0 )No s.i.. 

(13) 


The “No S.I.” means Wilson line self-interactions are 
temporarily excluded, in addition to interactions with trans¬ 
verse Wilson lines at light-cone infinity. The Greek letters 
are color triplet indices. An analogous soft factor can be 
defined in an octet representation. The soft factor is de¬ 
signed mainly to describe QCD radiation of gluons that 
are both nearly on-shell and at central rapidities. 


For the TMDs, one would like functions that are rem¬ 
iniscent of number the densities with light-like Wilson 
lines, but these suffer from light-cone divergences as previ¬ 
ously discussed. To regulate them while maintaining gauge 
invariance, one starts by defining TMDs with non-light- 
like Wilson lines. These are called the “unsubtracted” defi¬ 
nitions. For quark TMD PDFs and FFs, respectively, they 
are 


J 
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^unsub,[+](^^^T^5';„^(y^)) = Trc.n / —e x 


dw 


—ixP^ 


27r 


X {P,S\il}f W (^j,oo;nB(i/b)y oo; nB( 2 /b)) ipf 1^^,'S')no s.i., 


(14) 




dt(;+ ii.- 


27r 


gifc ^ 


X (0|7 W oo; UAiVa)^ i’f \H, S, X){H, S, X\ipf W UAiVaij |0)no s.i. ■ (15) 


(See [SJ Eqs. (13.108,13.41)] and associated discussions.) 

The definitions here do not use bare fields because the 
renormalization factors will be included in the final sub¬ 
tracted definition. The Trc,r> denotes traces over color and 

Dirac indices. The definition for bT,S; riBiUb)) 

is exactly the same as in Eq. (imi but with the main Wil¬ 
son lines past pointing rather than future pointing. 

The final definitions to be used in Eqs. 0-® are de¬ 
fined with the soft factors included, ordinary renormaliza- So 
tion, and in the limits of infinite rapidities for the main 
Wilson lines: 


The sensitivity to yg in each TMD is contained in the 
auxiliary parameters Cpdf and Cff: 


Cpdf = x "^, 

(18) 

Cff = . 

(19) 

CpdfCff = ■ 

(20) 


Pfypix,bT] fi, (pdf) 


= lim F'^J^'^’^^\x,bT;nB{yB))x 

>-+oo 

ys^-oo 


_ S( 0 ){bT', yA,Vs) _. 

\ S(^o){bT]ys,yB)S(o){bT',yA,yB) 

X UV ren, 


(16) 


Dh/ fiz,bT; y,CFF) 

= 1™ D'^/f'°{z,bT-,nA{yA))'x 

VA^ + OO 


S(o){bT,ys,yB) 


\ S(^o){bT;yA,ys)S(o){bT;yA,yB) 

X UV ren. 


(17) 


(See 0 Eqs. (13.106,13.42)] and associated discussions.) 
The soft rapidity yg now regulates light-cone divergences. 
The factor “UV ren” is an instruction to apply UV reno- 
ramlization and remove the UV regulator after the limits 
of yA{yB) —f +(—)oo have been taken. In Eqs. (I16I17I1 . 
Wilson line self-energies and interactions with transverse 
Wilson lines at light-cone infinity can be included now be¬ 
cause they cancel between the soft factors and the main 
Wilson line in the unsubtracted TMDs. Thus the “No S.I.” 
has been removed. 


The ^-parameters carry the memory of the need to regu¬ 
late Wilson lines to define separate TMD functions. 

For the Drell-Yan and e+e” processes in Eqs. (I7I9L the 
TMD PDFs (EEs) for hadrons moving with large minus 
(plus) momentum have the same definitions but with the 
plus and minus directions reversed, and corresponding re¬ 
placements of UAijiB) with nB{nA)- 

The notational complexity in Eqs. (I16I17I) maybe dis¬ 
guises an important simplicity in these definitions. Diver¬ 
gences are removed by multiplying unsubtracted TMDs 
by factors with relatively simple and universal proper¬ 
ties. This is closely analogous to ordinary ultraviolet (UV) 
renormalization, where bare operators are renormalized 
by multiplying with renormalization factors. It is useful 
to define a notation that emphasizes this analogy. First 
write 


ZcsiVs) = 

^CsiVs) = 


_ S(o){bT;+oo,ys) _ 

\ -5(0)(bT;?/s,-oo)^(0)(bT;+oo,-oo) 


S(o)ibT]yg,-oo) 


\ -5(o)(bT;+oo,ys)<5(o)(bT;+oo,-oo) 


, ( 21 ) 


• ( 22 ) 


The infinite plus and minus rapidities in the arguments on 
the right side should be taken to mean that one applies the 
limits of infinity plus and minus rapidities in combination 
with whatever ZcsiUs), ^csiVs) multiply on their left. 
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Then Eqs. (ITHl) - ([T71l are 
Cpdf) 

= FjJ^p'^'^~^\x,bT;nB{-oo))Zcs(.ys)ZpuF{fJ-)Z2{fJ,), 

(23) 


-D_f///(z, hT;/i,CFF) 

= D'^f^;f{z,bT;nA{+oo))Zcs{ys)ZMf^)Z2{f^), (24) 


"fK and 7j,pDF are anomalous dimensions for K{b'Y] and 
the TMD PDF respectively^ At small 6 t, 1/&t becomes a 
hard scale and the individual TMD PDFs can be expanded 
in a perturbative series in terms of collinear PDFs using 
an operator product expansion (OPF): 


Fj7ff(a;,&T;C,Ai) = X] / ^ &t; C, A*, ^^(/r)) 


where the products are defined to include first the lim¬ 
its of 2 /a(2/b) —t +(—)oo, and then the application of UV 
renormalization with removal of UV regulators. The fac¬ 
tors .^pdf,ff(m) are the UV renormalization factors for 
the PDF and FF. The factor ^2 (m) is the ordinary UV 
held strength renormalization factor. 

Limits associated with factors of Z should be taken in 
order from left to right; the order of factors in Eqs. (051) - 
((551) is important. The limits of inhnite Wilson line ra¬ 
pidities needs to be applied before UV regulators are re¬ 
moved (i.e., before e —> 0 in dimensional regularization). 
See 0 Sect. (10.8.2)] for a detailed discussion of the non¬ 
commuting limits. 

Note that all the dependence on j/g (or, equivalently, 
Q is in ZcsiUs)- The UV factors and rapidity renormal¬ 
ization factors are independent of the nature of the target 
or measured hadrons. The overall cross section is indepen¬ 
dent of /i and Us, but separate factors acquire and ys 
dependence from Z-factors. In this sense, ZcsiVs) is very 
much like a generalization of a standard renormalization 
factor. 


The Cj/k are Wilson coefficients, and p > 0. Equation (1551) 
is for an unpolarized TMD PDF so I have dropped the [±] 
for the Wilson line direction. TMD PDFs like the Sivers 
function need to be expanded in terms of twist three hard 
coefficients in the small 6 t limit. A set of equations anal¬ 
ogous to (155|) - (158)) holds for TMD FFs. 

The right side of Eq. (155|) is perturbatively calcula¬ 
ble if 1/6 t is much larger than O(Aqcd) and /i is fixed 
to ^ 1/6t- The anomalous dimensions (q:s(m)) and 
PDF(Q:s(/r); 1) are perturbatively calculable as long as 
y is much larger than O(Aqcd)- 

One striking difference between TMD evolution and 
collinear evolution is that Eq. (1261) implies that, in the 
limit of large 6 t, the evolution itself becomes nonperturba- 
tive. Predictive power is maintained because the A'(6 t; p) 
has strong universality, meaning it is independent not only 
of the process, but also the species of hadrons involved or 
any polarizations involved. It is even the same 
for TMDs and FFs. It follows from the universality of 
the ZcsiVs) from the previous section. Testing the strong 
universality of nonperturbative evolution is an important 
part of TMD phenomenology. 


5.3 Evolution 


The ordinary renormalization plus rapidity evolution in 
Eqs. (I551) - (I551) gives a system of evolution equations. (See, 
for example, [6l Eqs. (13.47,13.49,13.50)].) The rapidity 
evolution equation for the TMD PDF is 


dlnFf^^p{x,bT;y,X) 

d\ny^ 


K{bT; y) . 


(25) 


The right side is the CS evolution kernel K{bT', y), which 
is calculable in perturbation theory at small 6 t and using 
~ 1 /5 t as a hard scale. It obeys its own RG equation: 


dK{bT;y) 
d In 


-IK (asitk)) . 


The RG equation for the TMD PDF is 


(26) 


6 Solutions 


The TMD evolution equations only involve products of 
factors in transverse coordinate space, making solutions 
simple to write. In preparation for writing the solutions, 
one needs several definitions associated with the organi¬ 
zation of perturbative and nonperturbative parts. 

For small the right sides of Eqs. dlSD and (1281) 
can be calculated entirely in collinear perturbation theory 
with 1/5 t acting as the hard scale. The only nonpertur¬ 
bative inputs then are the collinear PDFs and FFs. To 
define what one means by “large” and “small” 6 t, one 
must define a cutoff scale 6max- Above 6max, one allows 
for nonperturbative fex-dependence, while below fomax one 
relies on collinear perturbation theory. A standard proce¬ 
dure is to define a b*(6T) such that 


d\^Flflix,bT,yX) 

d In /i 

= 7j.PDF(as(/^); 1) - . 


b^:{bT) 


(br br &max 
^max bx ^ ^max • 


(29) 


(27) 


^ A function that is basically equivalent to K is called —2D 
in and —Fgq in [93]. 
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A smooth transition function is typically used. One of the 
most common is [94] 


The perturbative and nonperturbative fex-dependence 
in Kib'T] jji) is separated by defining 


b,(bT) = b 




1 + 


One ultimately evolves to a scale 
Mb. = Ci/K ■ 


— gK^bT, bmax) = — /To) , (32) 

(30) 

which by construction vanishes like a power at small 6 t. 
In Eq. (l32]l . the scale /iq is arbitrary. 

Solving Eqs. (051) - (IM1) (and the equivalent equations 
for the EEs) to obtain the TMDs at arbitrary scales C, and 

(31) 


J 


AA 


F//p(x,bT;M,C) = XI J 


BB 


xexp'lln- A(6.;/rf,J+/ —- 


7/.PDF(a^i(M'); 1) - In ^jK{as{g')) 


cc 


X exp|-gy/p(a;, &T;&max) - 5 if(^T;&max)ln^| , 


and 


AA 


(^/'^’ ^^b,, fJ'b, , as [gb,))dH/j (■?, Mb.) 


BB 


X exp 


\ Mb. 


K{b*] Mb.) + I 


' iJ-h, 


7/.FF(as(M');l) -ln^7if(Q:a(M')) 


CC 


X exp|-gp//(z,6T;&max) -5A'(^T;&max)ln^| . 


(33) 


(34) 


In practice, one usually sets /i = i/C = Q to enable 
perturbative calculations. 

I have organized the solutions here into three factors, 
labeled “AA,” “BB,” and “CC,” following the method 
of Ref. [SS] to highlight the different components of an 
evolved TMD and connect type I and type II pictures 0 
The AA factor is a fixed order calculation in collinear 
perturbation theory of the small 6 t dependence. The BB 
factor is a perturbative evolution factor for relating scales 
and to general g and C. The CC factor in the 
last line includes all nonperturbative transverse coordi¬ 
nate dependence. Note the MA'(foT; &max) from Eq. (15^ . 
The M//p(a::, 6T;&max) and 5ib//(-2. &max) parametrize 

® I use double letters here to distinguish from other common 
uses of “A,” “B,” and “C.” 


the transition from the OPE calculation at fixed scale in 
Eq. (1^^ to the region where nonperturbative &t depen¬ 
dence is included. I have highlighted the roles of pertur¬ 
bative and nonperturbative behavior by making functions 
that are to be calculated entirely in fixed order perturba¬ 
tion theory red while those that include nonperturbative 
behavior are in blue. Note in particular that m_r-(&t; 6max) 
has no scale dependence, no subscript for /, P or A, no x 
or 2 ; dependence, and is the same function in both Eq. (1551) 
and Eq. (155)) . This emphasizes its strong universality, dis¬ 
cussed earlier. 

The 3 //p(x, 6t; 6max), 9H/f{z, hr] 6max) a.nAgK{bT\ 6max 
functions all show their explicit dependence on &max- Since 
it is an arbitrary cutoff, the overall cross section is exactly 
independent of 6max- If ^max is varied, changes in perturba- 
tively calculated parts should be compensated by changes 
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in gf/p{x,bT;b„,a^), 6t; &max) and gK^bp 5 bmax) • 

This was exploited in Ref. [96) to improve the transi¬ 
tion between the dominantly perturbative and dominantly 
nonperturbative regions of fex-dependence. 

The solutions in Eqs. (I551) - (IM1) are written in a way 
that maximizes the amount of perturbative input, but 
there are other ways of writing solutions that might be 
preferred, depending on the context. This was discussed 
recently in Ref. [35] ■ For example, to match to an exactly 
parton-model-like picture for some initial scale Qo, [961 
Eq. (24)] may be preferred. If one wishes to organize evo¬ 
lution with respect to center of mass energy i/s in Drell- 
Yan, rather than Q, then [96] Eq. (18)] might be preferred. 

Equations (El)-® are written here for unpolarized 
and azimuthally symmetric functions. There are similar 
formulas for other TMDs like the Sivers function EZllSH]. 

7 Comments 

I will end with some general remarks about the outlook of 
TMD applications and work that I believe is still needed. 

One important refinement needed for TMD evolution 
is to include flavor number transitions in the full TMD 
evolution analogous to what is done for collinear PDEs 
in the ACOT [3311100] formalism. Another complication is 
with the implementation of Y-term corrections. Achieving 
a smooth matching between the FE-term and the Y-term 
is more complicated in practice than a straightforward im¬ 
plementation of the definitions in the factorization deriva¬ 
tion |101j . More progress in this area is likely possible with 
further refinements in the details of implementations. 

Improvements in nonperturbative theory in treating 
the TMDs at low Q and low qt will be important for phe¬ 
nomenology since the factorization theorems alone pro¬ 
vide few detailed constraints on the these three dimen¬ 
sional objects. Specific methods will be discussed in other 
articles in this collection. Since TMDs describe inclusive 
processes, including the radiation of soft hadrons, efforts 
to constrain them will benefit from more detailed pictures 
of hadronization and fragmentation. An interesting exam¬ 
ple of fragmentation theory applications to TMDs is the 
use of the string model to describe the Collins mechanism 
in Ref. [102] . My perspective is that accounting for the 
constraints of factorization is critical for guiding the for¬ 
mulation of general pictures of the underlying physics. 

There are by now many other formulations of TMD 
factorization (or frameworks closely analogous to TMD 
factorization), and unfortunately it is not possible dis¬ 
cuss any one of them in detail here. An especially active 
approach in recent years is soft-collinear effective theory 
(SCET). There are at least three versions of TMD factor¬ 
ization that start from the perspective of SCET [103111041 
mm- Another approach to TMD factorization is that of 
Ji, Ma, and Yuan sa¬ 
lt is likely that insight can be gained by determining if 
and how different formulations of TMD factorization have 
meaningful differences, or whether they are actually equiv¬ 
alent formulations with different notation and/or conven¬ 
tions for intermediate steps. In Ref. m. a particular 


version of SCET was shown to be equivalent to the TMD 
factorization approach described in Sects. |4K6| of this arti¬ 
cle, at least to one-loop order. Other issues to consider are 
the small and large x limits, higher twist corrections, and 
the relationship to exclusive scattering. I refer the reader 
to other recent reviews such as [10611107] for discussions of 
some of these topics and relevant references. 

Detailed theoretical considerations indicate that TMD 
factorization should break down in some processes where 
more familiar parton model intuition might suggest that 
it applies [108lll09lll 10111111111211113] . The mechanisms for 
TMD factorization breaking have the potential to pro¬ 
duce interesting physical effects themselves, though more 
work is needed to determine how to calculate them. Eor 
high energies, effects associated with TMD factorization 
breaking are likely calculable in perturbation theory in 
the form of higher order large logarithms and resumma¬ 
tion techniques. 

Finally, TMD factorization is also expected to break¬ 
down in certain kinematical limits. For example, when 
target and hadron masses are important, or when the 
distribution of remnant masses are considered in detail, 
the approximations that give TMD factorization no longer 
suffice and corrections are needed. In such cases, it might 
be necessary to formulate other forms of factorization. For 
example, one might need something more like the fully un¬ 
integrated factorization advocated in Refs. [11411115111161 
111711118] . but probably more closely analogous to TMD 
factorization as it is formulated in Sect. of this review. 

I acknowledge many useful conversations with J. Collins who 
also provided suggestions on the text. I also thank C. Aidala 
and R. Fatemi for helpful comments on the text. This work was 
supported by the DOE contract No. DE-AC05-06OR23177, un¬ 
der which Jefferson Science Associates, LLC operates Jefferson 
Lab. 
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